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This paper presents a method that can achieve fast adaptation for a class of model-reference adaptive 
control. It is well-known that standard model-reference adaptive control exhibits high-gain control behaviors 
when a large adaptive gain is used to achieve fast adaptation in order to reduce tracking error rapidly. High- 
gain control creates high-frequency oscillations that can excite unmodeled dynamics and can lead to instability. 
The fast adaptation approach is based on the minimization of the squares of the tracking error, which is 
formulated as an optimal control problem. The necessary condition of optimality is used to derive an adaptive 
law using the gradient method. This adaptive law is shown to result in uniform boundedness of the tracking 
error by means of the Lyapunov’s direct method. Furthermore, this adaptive law allows a large adaptive gain 
to be used without causing undesired high-gain control effects. The method is shown to be more robust than 
standard model-reference adaptive control. Simulations demonstrate the effectiveness of the proposed method. 


I. Introduction 

In recent years, adaptive control has been receiving a significant amount of attention. The Aviation Safety Program 
under the NASA Aeronautics Research Mission Directorate (ARMD) has established the Integrated Resilient Aircraft 
Control (IRAC) research project to advance the state of the arts in adaptive control to enable flight control resiliency in 
the presence of adverse conditions. 1 There has been a steady increase in the number of adaptive control applications 
in a wide range of settings such as aerospace, robotics, process control, etc. Research in adaptive control continues 
to receive attention from government agencies, industries, and academia. In aerospace applications, adaptive control 
have been developed for many flight vehicles. For example, NASA has recently conducted a flight test in 2006 of 
a neural net intelligent flight control system on board of a modified F-15 test aircraft. 2 The U.S. Air Force-Boeing 
team has successfully developed and completed numerous flight tests of direct adaptive control on Joint Direct Attack 
Munitions (JDAM). 3 The ability to accommodate system uncertainties and to improve fault tolerance of a flight control 
system is a major selling point of adaptive control. Nonetheless, adaptive control still faces significant challenges in 
providing robustness in the presence of unmodeled dynamics and parametric uncertainties. The crash of the X-15 
aircraft in 1967 4 serves as a reminder that adaptive control is still viewed with some misgivings despite enormous 
advances in this technology ever since. The ability for an adaptive control algorithm to modify a pre-existing control 
design is considered a strength and at the same time a weakness. 

Over the past several years, various model-reference adaptive control (MRAC) methods have been investigated. 5-1 11314 
The majority of MRAC methods may be classified as direct, indirect, or a combination thereof. Indirect adaptive 
control methods are based on identification of unknown plant parameters and certainty-equivalence control schemes 
derived from the parameter estimates which are assumed to be their true values. 17 Parameter identification techniques 
such as recursive least-squares and neural networks have been used in indirect adaptive control methods. 7 In contrast, 
direct adaptive control methods directly adjust control parameters to account for system uncertainties without identify- 
ing unknown plant parameters explicitly. MRAC methods based on neural networks has been a topic of great research 
interests. 8-10 In particular, Rysdyk and Calise described a neural net direct adaptive control method for improving 
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tracking performance. 8 This method is the basis for the intelligent flight control system that has been developed for 
the F-15 test aircraft by NASA. Johnson et al. introduced a pseudo-control hedging approach for dealing with control 
input characteristics such as actuator saturation, rate limit, and linear input dynamics. 10 Idan et al. studied a hierar- 
chical neural net adaptive control using secondary actuators such as engine propulsion to accommodate for failures of 
primary actuators. 1 1 Hovakimyan et al. developed an output feedback adaptive control to address issues with paramet- 
ric uncertainties and unmodeled dynamics. 14 Cao et al. developed an adaptive control method to address high-gain 
control. 12 

While adaptive control has been used with success in a number of applications, the possibility of high-gain control 
due to fast adaptation can be an issue. In certain applications, fast adaptation is needed in order to improve tracking 
performance when a system is subject to a large source of uncertainties such as structural damage to an aircraft that 
could cause large changes in aerodynamic derivatives. In these situations, a large adaptive gain or learning rate must 
be used in the adaptive control in order to reduce the tracking error rapidly. However, there typically exists a balance 
between stability and adaptation. It is well-known that fast adaptation can result in high-frequency oscillations which 
can excite unmodeled dynamics that could adversely affect the stability of an MRAC law. Recognizing this, some 
recent adaptive control methods have begun to address high-gain control, such as the ££\ adaptive control 12 and a 
hybrid direct-indirect adaptive control. 15 In the former approach, the use of a low-pass filter effectively prevents 
any high frequency oscillation that may occur due to fast adaptation. In so doing, the reference model is no longer 
preserved and instead must be reconstructed using a predictor model. In the latter approach, an indirect adaptive law 
based on a recursive least-squares parameter estimation adjusts the parameters of a nominal controller to reduce the 
modeling error, and the remaining tracking error signal could then be handled by a direct adaptive law with a less 
aggressive learning rate. 

This paper introduces a new approach to fast adaptation in the MRAC framework. The method is formulated as an 
optimal control problem to minimize the tracking error .ift-norm. The optimality condition results in a modification to 
the MRAC law by introducing a damping term proportional to persistent excitation. The optimal control modification 
is then analyzed to determine convergence and stability characteristics. The analysis shows that this modification can 
achieve fast adaptation without high-frequency oscillations as in the case with the standard MRAC. Furthermore, the 
modification is shown to provide improved stability robustness while preserving the tracking performance. 

II. Optimal Control Modification for Fast Adaptation 



A direct MRAC problem as illustrated in Fig. 1 is posed as follows: 

Given a nonlinear plant model as 

x = Ax + B[u + f(x)\ (1) 

where x(t) : [0,°°) — + R" is a state vector, u(t) : [0,°°) — ► R p is a control vector, A £ R' 1X ” and B £ W xp are known 
plant matrices such that the pair ( A,B ) is controllable, and / (x) : M" — > R p is a matched uncertainty that acts as a 
disturbance. 
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Assumption 1: x(t) £ C (/) is smooth in t £ [0,°°). 

Assumption 2: f (x) G C 1 (x) is semi-globally Lipschitz. Then there exists a constant L > 0 such that 

||/(x)-/(x 0 )|| <L||x-x 0 || (2) 


for all HxlU < e x in t £ [0,°°). 

It then follows that the partial derivatives of / (x) are uniformly bounded and at least piecewise continuous such 

that 

d.f{x) 
dx 

for all ||x|U < e x in t £ [0,°°). 

Proposition 1: If u ( t ) is a stable and bounded controller, then the total derivative of /(x) is also bounded. 

Proof: u ( t ) is bounded if there exists a constant £„ > 0 such that HmH^ < £„Vf £ [0,°°). u ( t ) is a stable controller 
which implies thatx(f) is bounded and so Hx^ < £ x \/t £ Since x(f) is at least C 1 smooth by Assumption 1, 

x (f ) is bounded, / (x) is semi-globally Lipschitz, and if u ( t ) is bounded, then x (f ) is also bounded. Thus, there exists 
a constant vector o Xj >0sl,/=l,...n, such that sup, |x,j < o Xj Vt £ [0,°°). It then follows that 

n n 

J Y sup I Xi I < L.f Y Gxi = Of (4) 

oo 1=1 1 (= 1 

for some Of > 0 £ where J? £ R p is a vector whose elements are all equal to one. Therefore, / (x) £ Jzfoo. 

The objective of the problem is to design a full-state feedback controller that enables the nonlinear plant model to 
follow a reference model described by 

f/n = A m X m ' P n'J (5) 

where A m £ R' iX " is Hurwitz and a known matrix, B,„ £ M. nxp is also a known matrix, and r(t) : [0,°°) — > K p £ U is 
a bounded command vector with its time derivative r £ J2L also bounded. 

Defining the tracking error as e = x m — x, the goal is then to determine a controller that results in lim, ||e|| < £ e . 
Toward that end, let the controller be comprised of a state feedback, a command feedforward, and an adaptive signal 
as follows: 

u = K e e K m x m K r r u a( j (6) 

where K e £ R pxn , K m £ R px ", and K r £ R pxp are known nominal gain matrix, and u [/c i £ R p is a direct adaptive signal. 
Then, the tracking error dynamics become 

e = x m - x = (A - BK e ) e+(A m - A- BK m ) x m + ( B m - BK r ) r + B [u ad - f (x)] (7) 

For bounded tracking error, we choose A C =A — BK e = A m , and the gain matrices K m and K, to satisfy the model 
matching conditions so that the nominal plant tracks the reference model 


sup 

t 


df{x) 

< 

df(x) 

dt 


dx 


<L (3) 


A — BK e = A m (8) 

A + BK m =A m (9) 

BK r = B m (10) 

The adaptive signal u ad can be parameterized by a linear-in-parameters matched uncertainty 

lW = 0 T O(x) (11) 


where 0 £ R mxp is a weight matrix and <J> (x) : R” — »■ R”' is a known regressor vector. 

Let ©*_be a constant ideal weight matrix and 0 0 - 0 is a weight variation, then £ is the approximation error 

defined as 

£(x) = 0* t T>-/(x) (12) 

Assumption 3: The approximation error £ (x) of the matched uncertainty / (x) by © ( I> (x) is bounded and its time 
derivative is also bounded; i.e., there exists a constant vector ay > 0 G W such that 


SUp |£ (x)| = SUp 
t t 


d (0* T< t>) 
dt 


df{x) 


dt 


< o e 


( 13 ) 
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Assumption 3 essentially implies that <f> is Lipschitz and its partial derivative is bounded, i.e., there exist constants 
C > 0 and 77 > 0 such that 

(14) 


||0(x) -<J>(x 0 )|| <C||x-x 0 || 
<9<t> (x) 


dx 

for all ||x|L < e x in t £ [0,°°). 

The tracking error dynamics can now be expressed as 


<V 


' ^© T 4> + £j 


e =A c e + B ( © t T> + £ ] 

Defining S e = sup ( £ as an upper bound of £, then 


e < A c e + B ( © T 4> + 8 t 




(15) 


(16) 


(17) 


An optimal control modification to MRAC for fast adaptation is proposed as follows: 

Proposition 2: The following adaptive law provides a weight update law that minimizes the „S?2-norm of the track- 
ing error: 

© = -rWe T ,P- v<t> T ®B T PA- l ^)B (18) 

where F > 0 € K'" xr " is a symmetric positive-definite learning rate or adaptive gain matrix, v > 0 £ R is a positive 
weighting constant, and P > 0 € R" x ” is a symmetric positive-definite matrix P > 0 that solves the Lyapunov equation 

PA c +AjP = -Q (19) 


where Q > 0 G R" x " is a symmetric positive-definite matrix. 

Proof: The adaptive law seeks a solution that minimizes the .S^-norm of the tracking error with a cost function 

J = \ [ f (e — A) T Q(e — A)dt (20) 

^ Jo 

subject to Eq. (17) where A.represents the tracking error at t = tf. 

J is convex and represents the distance measured from the normal surface of a ball B, with a radius A. 



Fig. 2 - Tracking Error Bound 

This is an optimal control problem whose solution can be formulated by the Pontryagin’s Maximum Principle. 
Defining a Hamiltonian 


//(e,© T <h) =^(e-A) J Q(e-A)+p J (A c e + B® J <t> + B8 e ^ (21) 

where p G R" is an adjoint variable, then the adjoint equation is given by the negative gradient of the Hamiltonian with 
respect to the tracking error 

P = ~VHj = -Q( e -A)-Ajp (22) 
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Treating © T as a control variable, then the optimality condition is obtained by the gradient of the Hamiltonian with 
respect to © T <f> 

V% = p J B (23) 

The adaptive law can then be formulated by the gradient method as 

6 = -r4>V//@ T0 = -V<Pp~B (24) 

where T > 0 is an adaptive gain or learning rate matrix. 

If e (0) is known, then the transversality condition requires 

p(t f )= 0 (25) 

This results in a two-point boundary value problem whereby the adjoint p solves Eqs. (22) and (17) simultaneously. 
The optimal control problem can be solved using a “sweeping” method by letting p = Pe + .S’© <I>. Then 

Pe + p(A c e + B@ T <i>-B@* T <i> + B8 e ) +.S’0~T> + .S // ^ — > -Q(e-A)-Aj (Pe + S® T <t>') (26) 

Since 0 T <£> is the linear-in-parameter matched uncertainty of f(x), then by Proposition 1 and Assumption 3 


sup 

t 


d (0 T d>) 
dt 


■ sup 

t 




dt 


dt 


< sup 

© T q> + @ T q> 

t 



< sup 

t 


o e + a f 


-B T p4> T r4> 


-sup 

t 


© t 4 > 


C7 e + Gf 


(27) 


The first term in the last inequality of Eq. (27) is bounded since p must be a stable solution to the optimal control 
problem and <J> is also bounded by definition. The second term is also bounded since © must be bounded if the adaptive 
law is stable (an assertion that will be proved later) and <j> is bounded by the virtue of Assumption 3. Therefore, there 
exists a constant vector a, > 0 £ R" such that 


sup 

t 


d (© t 4>) 
dt 


< o, 


Equation (26) yields three equations 


subject to P ( tf ) = 0 and S ( tf ) = 0, and 


p+pa c +aJp+q = 0 

5 + PB+aJs = 0 


A < 



(<5, - ©*V) +Sa, 


(28) 

(29) 

(30) 

(31) 


Consider an infinite time-horizon optimal control problem by letting tf — » then P (t ) — > P (0) and S(t) — ► S (0) 

and the solutions of P and S are determined by their steady state values. Thus 

PA c +AjP=~Q (32) 

S = -A- J PB (33) 

The adjoint p now becomes 

p = Pe —A~ T PB@ T <t> (34) 

Since ©* is constant, then © = ©. Upon substituting the expression of the adjoint p into Eq. (24), the following 

adaptive law is then obtained 

© = -r4> (e T P - v<i> T ®B T PA- 1 ') B (35) 

where v > 0 € R is introduced as a weighting constant to allow for adjustments of the second term in the adaptive law. 


□ 
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Remark 1: The cost function (20) could also be penalized with 1 1 © T <J> 1 1 “ >0. This would result in an additional 

term in the adaptive law which would then become 

© = -r<f> (e J PB - v<i> T ®B T PA~ l B + <I> T ©p) (36) 

R then becomes an additional tuning parameter that can be used to adjust the adaptive law. Alternatively, the 
adaptive law could just only include the R term as 

© = -r4> [e J PB + <L t ©p) (37) 

The adaptive law (18) can be shown to be stable by the following theorem: 

Theorem 1: The adaptive law (18) results in stable and uniformly bounded tracking error in a compact set 


Z^max (P) [I-# || (||0* T< ^ ) || OO + 

^ min 0 Q ) 


Proof: Choose a Lyapunov candidate function 

V =e T Pe + trace ^© T r -1 ©^ (39) 

where P solves Eq. (19). 

Evaluating the Lie derivative of V yields 

V <e T (. A C P + PA C ) e + 2 e J PB (© T <1> - ©* T 4> + Se'j - 2trace © T <1> (e T PB - v<f> T ®B t PA~ l p) (40) 

Using the trace identity trace ( A J B ) = BA T , V can be written as 

V < - e T Qe+2e T PB(@ T <i>-®* T <i> + 8 - 2e T PB© T 4> + 2v4> T ©B T M c : 1 B© T 4> (41) 

The sign-definiteness of the term PA“ ! is now evaluated. We recall that a general real matrix M is positive (nega- 
tive) definite if and only if its symmetric part Ms = 7 (M + M t ) is also positive (negative) definite. Then, by pre- and 
post-multiplication of Eq. (19) by A“ T and A~ l , respectively, one gets 

A~ J P + PA~ l = -A~^QA~ l (42) 

Since A~ QA r 1 > 0, we conclude that PA " 1 < 0. Furthermore, PA“ 1 can be decomposed into a symmetric part 
M = \ (PA " 1 +A" t P) = — \A~ J QA~ l < 0 and an anti-symmetric part N = \ (PA " 1 — A“ T P). Then, V becomes 

V < -e J Qe + 2e J PB (-©* T <f> + <5 £ ) - v^ T ®B T A^ T QA- l B@ T ^> + 2v<P T ®B T NB® T ^> (43) 

Letting y = P© T <f> and using the property y Ny = 0 for an anti-symmetric matrix N, V is reduced to 

V < -e J Qe + 2e J PB (-©* T <I> + <5 £ ) - v^ T ®B T AJ T QA; l B® T ^ (44) 

and is bounded by 

V < -Kin{Q) \\e\\ 2 +2X max (P) ||B|| (|©* T a»| + ||5 e ||) IHI -vkminiQ) ||a- 1 P© t T >|| 2 (45) 

where ||©* T <J>|| = [|sup, |©* r< I>|||. 

To ensure that V < 0, we require that 

—kmin{Q) ||e|| 2 +2A TOU (P) ||B|| (||©* T <I>|| - 1 - ||5 e ||) ||e|| <0 (46) 

which implies 

IHI > 


2^(P)||fl||(||0* T «I>|| + ||5 g 

h min ( Q ) 
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It is noted that e € Jzfoo, but e G Jz ?2 fl =Sfc» since 


A mi „ (Q) J \\e\\ 2 dt < V (0) -V (t - oo) +2A ma * (P) ||S|| ( 


©* t 4> 


Hell* 


It follows that 



ax 1 b® t <p 

Jo 



V ^min (£ 2 ) 


* < ' 


poo 

a-'b® 7 ® 

Jo 



y ( t -+ oo) < y (0) - vA m ,„ (g) 
y (t) thus decreases inside a compact set 5P C M" where 

2A ma . v (P) ||B|| (||©* T< 1>|| + ||5, 


dt < oo (48) 


(49) 


=5^ = < e € M" : ||e|| > r = 


^ min (Q) 


(50) 


but y (t) increases inside the complementary set e £ = {e € R" : ||e|| < r}, which contains e = 0, whose trajectories 
will all stay inside of c £. It follows by LaSalle’s extensions of the Lyapunov method that e is uniformly bounded and 
so is ©. 


□ 

Remark 2: The effect of the added term in the present adaptive law is to add damping to the weight update law so 
as to reduce high-frequency oscillations in the weights. The damping term requires persistent excitation (PE) which 
is defined by the product term <t><t> . With persistent excitation, the weight © is exponentially stable and bounded. 
This scheme is contrasted to the well-known cr- 17 and e- 16 modification methods and other variances which also add 
damping terms to prevent parameter drift in the absence of persistent excitation. 16 These adaptive laws are compared 
as follows: 


Modification 

Adaptive Law 

cr- 

© = -T (<i>e J PB + cr©), cr > 0 

e- 

©=-r (<t>e J PB + n e T PB ®),ju>0 

Optimal 

© = -r (<t>e T PB - v<P<P t ®B t PA~ 1 B), v > 0 


Table 1 - Modifications to MRAC Law 


□ 


In the presence of fast adaptation, i.e., X rmn (T) 1, the adaptive law (18) is robustly stable with all closed-loop 

poles having negative real values if v = 1. This fact can be established in the following theorem: 

Theorem 2: For large adaptive gain V and <E> <t> > 0 which implies PE, the adaptive law (18) results in robustly 
stable closed-loop tracking error dynamics 

e < —P~ l Qe + B ^sup 

when v = 1 . 

Proof: The adaptive law (18) can be written as 

r“ 1 © = -<p(e T P- v<t> T ®B T PA-^B (52) 

If r 1 is large, then in the limit as F — > °° 

e T P- v<t> T ®B T PA~ l = 0 (53) 

Solving for fi© T <l> yields 

B® T <t>=-P l AjPe (54) 

v 


©* T <1> 


(51) 
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Hence, the closed-loop tracking dynamics become 

e= + + < ^A c + ^P~ l AjP^ e+B ^sup ©* T <I> + 8 £ ^j (55) 

which, upon some algebra, can also be written as 

e<-P~ l Q-(^-l^A]P e+B (sup ©* T 4> + 8^j (56) 

Aj P can be decomposed into a symmetric part ^ (Aj P + PA,) = — i Q and an anti-symmetric part ^5 = ^ (A J P — PA,) . 
The tracking error dynamics now become 

e<-P~ l (j^js e + B^sup|©* T 4>|+5^ (57) 

The eigenvalues of Q are all real positive values and those of S are purely imaginary. The system is stable for 
all values of v. If v < 1, the closed-loop complex-conjugate poles move further into the left-half plane and Im [.v] 
increases with decreasing v. In the limit, when v — > 0 and the adaptive law is reverted to the standard MRAC law, 
then Im [s] — > °° which illustrates a well-known fact that fast adaptation with the standard MRAC law results in high 
frequency signals which can potentially lead to instability in the presence of time delay or unmodeled dynamics. 
Conversely, if v becomes large, the effect of adaptation is reduced and in the limit when v — > °°, adaptation ceases as 
the adaptive law (18) becomes infinitely stiff. 

A special case of v = 1 is considered. The closed-loop poles are all real, negative values with Re [s] = —A [P l Q)- 
The system transfer function matrix H (s) = (si - P *Q) 1 is strictly positive real (SPR) since // (/ft)) H ( — /' oj ) > 

0, and thus the system is minimum phase and dissipative. 18 The Nyquist plot of a strictly stable transfer function is 
strictly in the right half plane with a phase shift less than or equal to f , 18 

□ 

Remark 3: The adaptive law for fast adaptation results in a LTI representation of the tracking error dynamics in 
the limit when F — ► This is a useful feature of this adaptive law that can enable the stability of the system to be 

analyzed using traditional linear control methods. 

The adaptive law (18) causes the tracking error to tend to zero as V — > 0 if 7 — > °° for fast adaptation. On the other 
hand, stability robustness requires v > 0. Thus, a trade-off between tracking performance and stability robustness 
exists and, consequently, v becomes a design parameter to be chosen to satisfy control requirements if F is large and 
the input is PE. This can be shown as follows: 

Lemma 1 : The equilibrium state y = 0 of the differential equation 

>- = -O t (r)ro(r)>- (58) 

where v (?) : [0,°°) — > R, <J> (?) G Jz?2 : [0,°°) — > R" is a piecewise-continuous and bounded function, and F > 0 € R" x ", 
is uniformly asymptotically stable, if there exists a constant 7 > 0 such that 

4> T (T)r4>(T)t/T> 7 (59) 

which implies that y is bounded by a linear differential equation 

y<-yy (60) 

Proof: Choose a Lyapunov candidate function 

y = iy 2 (61) 

V = ~-4> T (?)r4>(?)y 2 = -2<& T (?)r<I>(?)V (62) 

Then, there exists 7 > 0 for which V is uniformly asymptotically stable since 

H-^o \ 

<E T (T)r4>(T)BTj <V(t)e~ 2yr ° (63) 
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This implies that 


exp 


rt+T 0 


-2 


<E> 1 (T)r4>(T)<a'T ) < e 


-2 yr 0 


Thus, the equilibrium y = 0 is uniformly asymptotically stable if 


1 

% 


ft+T 0 


T> t (t)F<I>(t)</t > 7 


(64) 


(65) 


provided (?) G ££ \ is bounded. 

Then y (?) G ££2 n ££«, since 


It follows that 


noo noo 

V(?-> oo)-y(O) < -2y / y(t)dt=> 2y / y 2 (t)dt < V (0) - V (? -> ») < , 
Jo Jo 

y(t + Tq) = y(?)exp j' (T)r<I>(T)</T^ <y{t)e~ rT ° 


which is equivalent to 


y = -<E> t (?)r<I>(?)y < — 7y 


( 66 ) 


(67) 


Now, suppose that <f> = <t> (y (?)), Eq. (67) still applies. The condition < f > (y (? )) € ££2 is satisfied since y G ££2 Cl ££<*>. 
To show this, we first evaluate V as 

E = -<I> T (y(?))r<I>(y(?))y 2 (68) 


which upon integration yields 

V{t + T 0 ) = V(?)exp ^-2jf ° <I> T (y (t)) r<P (y (t)) dz^ <V(t)e~ 2yr ° 

Thus, V is uniformly asymptotically stable. This condition is also equivalent to 

V < -2yV => yy < -7 y 2 

which then implies Eq. (67). 

Example : Consider <p (?) = (? + I) -1 G ££ 2 - 7 is evaluated as 
The solutions of y = —y(j) 2 (?) and y\ = —7^1 with y (0) = yi (0) are 


(69) 


(70) 


y (?) =y(0)exp 
yi(?)=y( 0 )expf- 


t ~\~ 1 
t 

To+1 


(71) 

(72) 

(73) 


Clearly, y < y j since Tq > t. Therefore, y <y\. 

Lemma 1 is a version of the Comparison Lemma that allows bounds on the solution of y (?) to be computed from a 
differential inequality without the need to compute the solution itself. 1 9 A different version of the proof is also provided 
by Nadrenda and Annaswamy. 20 Figure 3 illustrates various functions as compared to their linear counterparts. 
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Fig. 3 - Comparison of Solutions of Differential Equations 


Theorem 3: The steady state tracking error is bounded by 

vkmax(P) ||Z?|| (||®* T 3 ) || <X) + ||5 e 


limsup ||e|| < 

t 


■ + ■ 


IKII m (II/3II- 


f " " Omin {A J P + vPA c ) ya min ( BB J ) a min (. AjP + vPA c ) 

if there exist a constant y > 0 such that y = inf, \ f t +T ° < t ,T r<t>^) >0£® and a constant vector P > 0 £ R" 
P = sup, |© T 4>|. 


Proof: Since e £ ,Z J 2 , x £ .ZZ and so <t> ( x ) £ .Zi. Using Lemma 1, the adaptive law (18) can be written as 
— (© T <1>) = © T <1> + © T <1> <-y (B T Pe - vB J Ap T PB@ T <P) + p 
p = sup, |© T <f>| £ «Sfco is bounded since <t> £ ZL is bounded by Assumption 3 because 


©* T sup |<I>| = sup 


e(x) + 


df(x) 


dt 


< sup|e(x)| + sup 


df{x) 


dt 


< O e + Of 


e 

< 

A c 

B 

e 

i 

B8 e 

© t 4> 


-yB r P 

yvB J Ay r PB 

© t <i> 


YvB T A~ T PB®* T <i> + p 


Thus, the system dynamics with adaptation are described by 

d 
dt 


Differentiating the tracking error dynamics and upon substitution yields 

e-Ace + yBB T Pe - YvBB T A~ T PB® T <t> < B (jvB T Ap T PB®* J <Z> + P + a ( 
The steady state tracking error is thus bounded by 

(jBB T P+yvBB T A~ T PA c ^ e < yvBB J A~ r PB (V T <f> - S e ) +B {p + o e ) 
from which the upper bound on the norm of the steady-state tracking error is computed as 

vKax{P) ||fi||(||0* T OL+||5 e ||) ||A C || ||fi|| (||/3 || + llffell) 


limsup ||e|| < 


a mi n{AjP + vPA c ) 


■ + 


yamin ( BB T ) o min (Aj p + vPAc) 
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(74) 
where 

(75) 

(76) 

(77) 

(78) 

(79) 

(80) 
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where <7 m j n denotes the minimum singular value. 

Similarly, the steady state value and the upper bound of its norm are obtained as 

- (jvB J AJ J P+yB J PA-^ B® J <t> < (yvB J A~ T PB®* T <i> + l 3 + C7 e ) + yB J PA- l B8 e (81) 


lim sup 


© 


Vhmax(P) ||A C || 1 1 0* T <$> 
O min (AjP+VPA c ) 


W(P)||A c || 2 ||&|| 

&min (Ac) Gmin (A J P -T V' PA c ) 


IK-|| 2 (||/3|| + IM) 

JOmin ( BB J ) a min (Aj P + vPA c ) 


(82) 

Thus for fast adaptation with PE, i.e., y — > °°, the second term on the RHS of Eq. (80) goes to zero, and the tracking 
error only becomes bounded and is dependent on v. If, in addition, v — > 0, then e — ► 0, but if v — > °°, e G Jzfoo is finite 
and does not tend to zero. Thus, v has to be selected small enough to provide a desired tracking performance, but large 
enough to provide sufficient stability margins against time delay or unmodeled dynamics. 


□ 


III. Application to Flight Control 


Consider the following adaptive flight control architecture as shown in Fig. 4. The control architecture comprises: 
1) a reference model that translates rate commands into desired acceleration commands, 2) a proportional-integral 
(PI) feedback control for rate stabilization and tracking, 3) a dynamic inversion controller that computes actuator 
commands using desired acceleration commands, and 4) a neural net direct MRAC due to Rysdyk and Calise 8 



Fig. 4 - Direct Neural Network Adaptive Flight Control 


Adaptive flight control can be used to provide consistent handling qualities and restore stability of aircraft under 
off-nominal operating conditions such as those due to failures or damage. A reduced-order equation of the linearized 
angular motion of a damaged aircraft can be described by 


x = Ax + Bu + Gz + f(x,u,z ) 


(83) 


where x = 


i T 


z = 


p q r 
i T 


is the angular rate vector; u = 8„ 8 e 8, 


a p 8 t 


is a trim state vector; A G ' 


3x3 


,B€. 


3x3 


, and G G i 


3x3 


structured uncertainty which has the form 


the control surface deflection vector; 
are known; and f(x,u,z ) represents a 


/ (x) = A Ax + A Bu + AGz 


(84) 


where AA, A B, and AG are changes to the A, B, and G matrices of the aircraft linear plant model. 

The objective is to design a dynamic inversion flight control law with a direct adaptive control augmentation to 
provide consistent handling qualities which may be specified by a reference model according to 


Xm — A lf! x B m r (85) 

where A„, G K 3x3 is Hurwitz, B m G R 3 " 3 is known, r G Jzf 2 is a bounded pilot command with its time derivative r G 
also bounded 
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Let x,[ be a desired acceleration that comprises the reference model acceleration, a proportional-integral feedback 
control, and a neural net adaptive signal 


Xd — A m X m A B m V "b Kp {x m x) Kf I ( A ffi X) dX U a d 

Jo 


( 86 ) 


where u ad = 0 <t> with <f> = 


.T _T 


Assuming that B is invertible, then the dynamic inversion controller is computed as 

u = B 1 (x d — Ax — Bu — Gz ) 


(87) 


Computing the acceleration error yields 


x = —K p x — Kj 


/ xdT + U ad -f(x, 
Jo 


( 88 ) 


where Jc = x rn —x,K p = diag (k p \ ,k p 2 ,k p T,) > 0, and Kj = diag (kj \ ■ki i-Mi^) > 0 are matrices of the proportional and 
integral gains for roll, pitch, and yaw. 

Defining the tracking error as 


e = 


fo* d T 

X 


(89) 


then the tracking error dynamics are expressed by 


e=A c e + b (u ad - f ) 


(90) 


where 


A c = 


- Kt -K„ 


b = 


Let Q = 21, then the solution of Eq. (19) yields 


P = 


K7 l K p + K~ l (Ki+I) 


K7 


KT 


Kp l ( r + K r l ) 


>0 


A c 1 is computed to be 


at‘ = 


-K,r‘K p -K7 1 

I 0 


Evaluating the term b T PA c 1 h yields 


b T PAT 1 b = -KT 2 < 0 


Applying the the adaptive law (18), the weight update law is then given by 


© = -r<L ( e T Pb + v<L T 0/i:r 2 


(91) 

(92) 

(93) 

(94) 

(95) 

(96) 


Thus, the damping term in the adaptive law only depends on the integral gain Kj. 

A simulation of pitch rate doublet is performed to illustrate the adaptive law (18) with the optimal control modifi- 
cation. The uncertainty is due to airframe structural damage which in this case represents a 25% loss of the left wing 
of a generic transport model (GTM) as shown in Fig. 5. 
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Fig. 5 - Damaged Generic Transport Model 

Figure 6 is a plot of the aircraft angular rates with only PI control and without adaptive control. Due to the 
asymmetric damage, a pitch rate command results in both roll and yaw rate responses due to cross-coupling effects. 
The response is completely unacceptable due to the excessive roll and yaw rates. 




Fig. 6 - Aircraft Rate Response with PI Control 

Figure 7 is a plot of the aircraft angular rates due to the standard direct MRAC (v = 0) using a learning rate 
r = 10 4 . The tracking performance drastically improves in all axes. However, high-frequency oscillations can clearly 
be seen in the yaw rate response and to a much lesser extent in the pitch and roll channels.. Further increase in the 
learning rate results in progressively larger high frequency amplitudes and eventually leads to a numerical instability 
when r > 2 x 10 4 due to a sampling limitation. 
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Fig. 7 - Aircraft Rate Response with Standard MRAC 

In contrast, the aircraft rate response for the optimal control modification tracks the reference model very well 
as can be seen in Fig. 8. Furthermore, the optimal control modification results in no observable high-frequency 
oscillation in spite of the fact that the learning rate is two orders of magnitude greater than that for the standard direct 
MRAC. For this simulation, a value of v = 0.033 is used. A larger value of v will degrade the tracking performance 
but improve stability robustness. For comparison, the simulation also includes the e-modification as shown in Fig. 9. 
A value of /i = 0.25 is used with a learning rate V = 10 4 . The e-modification significantly reduces the high frequency 
in the yaw rate response, but at the expense of the tracking performance as the amplitudes in the roll and yaw channels 
significantly increase. 




Fig. 8- Aircraft Rate Response with Optimal Control Modification 
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Fig. 9- Aircraft Rate Response with e-Modification 


The simulation illustrates a potential benefit of the optimal control modification for fast adaptation. In practice, 
there is a practical limit of how large a learning rate would be. In general, actuator dynamics can impose constraints 
on the learning rate. The frequency separation between the adaptation and actuator dynamics can lead to potential 
problems. Nonetheless, the optimal control modification demonstrates the tolerance to larger learning rates than the 
standard MRAC which can be beneficial when fast adaptation is needed to deal with large uncertainties. 

One of the issues with adaptive control is the lack of metrics to assess stability robustness in the presence of 
unmodeled dynamics and or time delay. With fast adaptation, it is known that the direct MRAC results in reduced 
phase and time-delay margins. 21 Thus, the learning rate must be chosen carefully in order to avoid instability due to 
time delay and unmodeled dynamics. The optimal control modification is shown to provide more robustness when v 
approaches unity. Hence, it can also increase a system’s tolerance to destabilizing uncertainties like time delay. 

An approximate, simple method for analyzing the stability margin of the optimal control modification is presented. 
Toward that end, the tracking error dynamics can be expressed as 

x m -x = b T e<b T Ae + b T b(® T @ + 8 e ) (97) 

From Eq. (75), we get 


© T d> < (.v + yvK t 2 ) 1 ( ~yb T Pe 



(98) 


where y < f <t> T r<t>(7T > 0 € R according to Lemma 1. 
Substituting Eq. (98) into Eq. (97) yields 


A m x m + B m r — sx < — 


— +K P ') + y(s + yvK i 2 ) 1 f— +P 22 


+ (s + yvK j 2 ) 


yvxr 2 ||©* T d> 


■ + - +S e (99) 


where P l2 = K i l = diag (kj , kj , kj) , and P 22 = K p 1 (l + K i 1 ) = diag (k p \ + k p \k u ' , k p ] + k p $kj , k p 1 + k^kj') . 
Thus, the loop transfer function matrix from x to x is then obtained as 


H(s) = - 
s 


-+K p )+y(s + yvKr 2 ) l f£ll + p 22 \ 


(100) 
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which can be broken into individual loop transfer functions since K p , Kj, P\i, and P22 are all diagonal matrices which 
imply PI (s) is also diagonal whose elements are 


hj (s) = 


k pJ s2 + + T vk P,i k ij + ypn,j) s + Yvkij + YP22J 


yvkifs 


2.2 


i = 1 , 2,3 


(101) 


Figure 10 is a plot of the phase margin of hi (x) for the pitch rate as a function of v for different values of y. 
Increasing v is shown to result in improved phase margin. Increasing y causes the phase margin at v = 0 to approach 
to zero, but moves the phase margin closer to 90 degrees for v > 0. The margin at v = 1 is not necessarily the 
greatest. For large values of y, this occurs at some values of v < 1, but the differences are small from that at v = 1. 
The phase margin converges asymptotically to its value corresponding to the non-adaptive PI control when v is large. 
Theoretically, the ideal time-delay margin of this transfer function is infinite. Realistically, it is expected that the 
optimal control modification would provide an increase in the time-delay margin, which is proportional to the phase 
margin. 

It should be noted that Fig. 10 should be viewed in a relative sense rather than an absolute sense. The key research 
question is how to select 7o which is a time window in which the parameter y is to be computed. In Fig. 10, y 
is computed for the entire time interval which may provide unrealistic estimates of stability margins. One research 
idea has been suggested to adjust the learning rate or in this case the parameter v periodically by evaluating y for a 
moving time window during which the system is approximately bounded by an equivalent LTI system based on the 
Comparison Lemma. 21 



Fig. 10 - Phase Margin Analysis of Optimal Control Modification 

Figure 1 1 illustrates the time delay effect on the optimal control modification. A time delay is introduced between 
the aircraft plant input and output to simulate destabilizing uncertainties. For the same learning rate T = 10 4 , the 
standard MRAC can tolerate up to 0.004 s time delay before the adaptive law goes unstable. With the optimal control 
modification, the time delay margin increases to 0.010 s and 0.114 s for v = 0.033 and v = 0.33, respectively. Thus, 
this is consistent with the observation above that increasing v results in improved stability margins. However, this 
would come at the expense of tracking performance which would become worse as v increases. 


16 of 18 


American Institute of Aeronautics and Astronautics 



Fig. 11- Pitch Rate Responses with Time Delay 

IV. Conclusions 

This study presents a new modification to the standard model-reference adaptive control based on an optimal con- 
trol formulation of minimizing the norm of the tracking error. The modification adds a damping term to the adaptive 
law that is proportional to the persistent excitation. The modification enables fast adaptation without sacrificing ro- 
bustness. When the learning rate tends to a very large value, the tracking error dynamics become approximately linear 
in a bounded sense. This is a useful feature that can allow stability of the adaptive law to be studied in the context 
of linear time invariant systems. The modification can be tuned using a parameter v to provide a trade-off between 
tracking performance and stability robustness. Increasing v results in better stability margins but reduced tracking 
performance. When v approaches unity, the system has a phase shift close to 90 degrees. Simulations demonstrate the 
effectiveness of the modification, which shows that tracking performance can be achieved at a much larger learning 
rate than the standard model-reference adaptive control and that the adaptive law can tolerate a much greater time 
delay in the system. 
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